311 research outputs found

    QTLMAS 2010: simulated dataset

    Get PDF

    Potentiality in Biology

    Get PDF
    We take the potentialities that are studied in the biological sciences (e.g., totipotency) to be an important subtype of biological dispositions. The goal of this paper is twofold: first, we want to provide a detailed understanding of what biological dispositions are. We claim that two features are essential for dispositions in biology: the importance of the manifestation process and the diversity of conditions that need to be satisfied for the disposition to be manifest. Second, we demonstrate that the concept of a disposition (or potentiality) is a very useful tool for the analysis of the explanatory practice in the biological sciences. On the one hand it allows an in-depth analysis of the nature and diversity of the conditions under which biological systems display specific behaviors. On the other hand the concept of a disposition may serve a unificatory role in the philosophy of the natural sciences since it captures not only the explanatory practice of biology, but of all natural sciences. Towards the end we will briefly come back to the notion of a potentiality in biology

    Bayesian estimates of linkage disequilibrium

    Get PDF
    [Background] The maximum likelihood estimator of D' – a standard measure of linkage disequilibrium – is biased toward disequilibrium, and the bias is particularly evident in small samples and rare haplotypes. [Results] This paper proposes a Bayesian estimation of D' to address this problem. The reduction of the bias is achieved by using a prior distribution on the pair-wise associations between single nucleotide polymorphisms (SNP)s that increases the likelihood of equilibrium with increasing physical distances between pairs of SNPs. We show how to compute the Bayesian estimate using a stochastic estimation based on MCMC methods, and also propose a numerical approximation to the Bayesian estimates that can be used to estimate patterns of LD in large datasets of SNPs. [Conclusion] Our Bayesian estimator of D' corrects the bias toward disequilibrium that affects the maximum likelihood estimator. A consequence of this feature is a more objective view about the extent of linkage disequilibrium in the human genome, and a more realistic number of tagging SNPs to fully exploit the power of genome wide association studies.Research supported by NIH/NHLBI grant R21 HL080463-01, NIH/NIDDK 1R01DK069646-01A1 and the Spanish research program [projects TIN2004-06204-C03-02 and TIN2005-02516]

    Inference of population splits and mixtures from genome-wide allele frequency data

    Full text link
    Many aspects of the historical relationships between populations in a species are reflected in genetic data. Inferring these relationships from genetic data, however, remains a challenging task. In this paper, we present a statistical model for inferring the patterns of population splits and mixtures in multiple populations. In this model, the sampled populations in a species are related to their common ancestor through a graph of ancestral populations. Using genome-wide allele frequency data and a Gaussian approximation to genetic drift, we infer the structure of this graph. We applied this method to a set of 55 human populations and a set of 82 dog breeds and wild canids. In both species, we show that a simple bifurcating tree does not fully describe the data; in contrast, we infer many migration events. While some of the migration events that we find have been detected previously, many have not. For example, in the human data we infer that Cambodians trace approximately 16% of their ancestry to a population ancestral to other extant East Asian populations. In the dog data, we infer that both the boxer and basenji trace a considerable fraction of their ancestry (9% and 25%, respectively) to wolves subsequent to domestication, and that East Asian toy breeds (the Shih Tzu and the Pekingese) result from admixture between modern toy breeds and "ancient" Asian breeds. Software implementing the model described here, called TreeMix, is available at http://treemix.googlecode.comComment: 28 pages, 6 figures in main text. Attached supplement is 22 pages, 15 figures. This is an updated version of the preprint available at http://precedings.nature.com/documents/6956/version/

    Simultaneous Analysis of All SNPs in Genome-Wide and Re-Sequencing Association Studies

    Get PDF
    Testing one SNP at a time does not fully realise the potential of genome-wide association studies to identify multiple causal variants, which is a plausible scenario for many complex diseases. We show that simultaneous analysis of the entire set of SNPs from a genome-wide study to identify the subset that best predicts disease outcome is now feasible, thanks to developments in stochastic search methods. We used a Bayesian-inspired penalised maximum likelihood approach in which every SNP can be considered for additive, dominant, and recessive contributions to disease risk. Posterior mode estimates were obtained for regression coefficients that were each assigned a prior with a sharp mode at zero. A non-zero coefficient estimate was interpreted as corresponding to a significant SNP. We investigated two prior distributions and show that the normal-exponential-gamma prior leads to improved SNP selection in comparison with single-SNP tests. We also derived an explicit approximation for type-I error that avoids the need to use permutation procedures. As well as genome-wide analyses, our method is well-suited to fine mapping with very dense SNP sets obtained from re-sequencing and/or imputation. It can accommodate quantitative as well as case-control phenotypes, covariate adjustment, and can be extended to search for interactions. Here, we demonstrate the power and empirical type-I error of our approach using simulated case-control data sets of up to 500 K SNPs, a real genome-wide data set of 300 K SNPs, and a sequence-based dataset, each of which can be analysed in a few hours on a desktop workstation

    A New Method to Reconstruct Recombination Events at a Genomic Scale

    Get PDF
    Recombination is one of the main forces shaping genome diversity, but the information it generates is often overlooked. A recombination event creates a junction between two parental sequences that may be transmitted to the subsequent generations. Just like mutations, these junctions carry evidence of the shared past of the sequences. We present the IRiS algorithm, which detects past recombination events from extant sequences and specifies the place of each recombination and which are the recombinants sequences. We have validated and calibrated IRiS for the human genome using coalescent simulations replicating standard human demographic history and a variable recombination rate model, and we have fine-tuned IRiS parameters to simultaneously optimize for false discovery rate, sensitivity, and accuracy in placing the recombination events in the sequence. Newer recombinations overwrite traces of past ones and our results indicate more recent recombinations are detected by IRiS with greater sensitivity. IRiS analysis of the MS32 region, previously studied using sperm typing, showed good concordance with estimated recombination rates. We also applied IRiS to haplotypes for 18 X-chromosome regions in HapMap Phase 3 populations. Recombination events detected for each individual were recoded as binary allelic states and combined into recotypes. Principal component analysis and multidimensional scaling based on recotypes reproduced the relationships between the eleven HapMap Phase III populations that can be expected from known human population history, thus further validating IRiS. We believe that our new method will contribute to the study of the distribution of recombination events across the genomes and, for the first time, it will allow the use of recombination as genetic marker to study human genetic variation

    A Bayesian Outlier Criterion to Detect SNPs under Selection in Large Data Sets

    Get PDF
    Background: The recent advent of high-throughput SNP genotyping technologies has opened new avenues of research for population genetics. In particular, a growing interest in the identification of footprints of selection, based on genome scans for adaptive differentiation, has emerged.[br/] Methodology/Principal Findings: The purpose of this study is to develop an efficient model-based approach to perform Bayesian exploratory analyses for adaptive differentiation in very large SNP data sets. The basic idea is to start with a very simple model for neutral loci that is easy to implement under a Bayesian framework and to identify selected loci as outliers via Posterior Predictive P-values (PPP-values). Applications of this strategy are considered using two different statistical models. The first one was initially interpreted in the context of populations evolving respectively under pure genetic drift from a common ancestral population while the second one relies on populations under migration-drift equilibrium. Robustness and power of the two resulting Bayesian model-based approaches to detect SNP under selection are further evaluated through extensive simulations. An application to a cattle data set is also provided.[br/] Conclusions/Significance: The procedure described turns out to be much faster than former Bayesian approaches and also reasonably efficient especially to detect loci under positive selection

    Depression and sickness behavior are Janus-faced responses to shared inflammatory pathways

    Get PDF
    It is of considerable translational importance whether depression is a form or a consequence of sickness behavior. Sickness behavior is a behavioral complex induced by infections and immune trauma and mediated by pro-inflammatory cytokines. It is an adaptive response that enhances recovery by conserving energy to combat acute inflammation. There are considerable phenomenological similarities between sickness behavior and depression, for example, behavioral inhibition, anorexia and weight loss, and melancholic (anhedonia), physio-somatic (fatigue, hyperalgesia, malaise), anxiety and neurocognitive symptoms. In clinical depression, however, a transition occurs to sensitization of immuno-inflammatory pathways, progressive damage by oxidative and nitrosative stress to lipids, proteins, and DNA, and autoimmune responses directed against self-epitopes. The latter mechanisms are the substrate of a neuroprogressive process, whereby multiple depressive episodes cause neural tissue damage and consequent functional and cognitive sequelae. Thus, shared immuno-inflammatory pathways underpin the physiology of sickness behavior and the pathophysiology of clinical depression explaining their partially overlapping phenomenology. Inflammation may provoke a Janus-faced response with a good, acute side, generating protective inflammation through sickness behavior and a bad, chronic side, for example, clinical depression, a lifelong disorder with positive feedback loops between (neuro)inflammation and (neuro)degenerative processes following less well defined triggers

    Search for the Decays B^0 -> D^{(*)+} D^{(*)-}

    Full text link
    Using the CLEO-II data set we have searched for the Cabibbo-suppressed decays B^0 -> D^{(*)+} D^{(*)-}. For the decay B^0 -> D^{*+} D^{*-}, we observe one candidate signal event, with an expected background of 0.022 +/- 0.011 events. This yield corresponds to a branching fraction of Br(B^0 -> D^{*+} D^{*-}) = (5.3^{+7.1}_{-3.7}(stat) +/- 1.0(syst)) x 10^{-4} and an upper limit of Br(B^0 -> D^{*+} D^{*-}) D^{*\pm} D^\mp and B^0 -> D^+ D^-, no significant excess of signal above the expected background level is seen, and we calculate the 90% CL upper limits on the branching fractions to be Br(B^0 -> D^{*\pm} D^\mp) D^+ D^-) < 1.2 x 10^{-3}.Comment: 12 page postscript file also available through http://w4.lns.cornell.edu/public/CLNS, submitted to Physical Review Letter
    corecore